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II. TABLES 


I. EXPLANATION 


I. PROPORTIONATELY SMALL SAMPLES 


These Tables have been specially constructed for practical use in 
sanitary, pathological and clinical work. 

Suppose that we are studying things of any nature, such for 
example as men, cases of sickness, insects, leucocytes, trypanosomes, 
seeds, stones, etc, and wish to ascertain what proportion of all of 
them belong to a particular class, that is, have a given characteristic. 
Then we can answer this question with absolute certainty only in one 
way, namely, by examining the whole number of such things that 
exist in the region under consideration. But this is generally 
impossible; and we must, therefore, content ourself with examining 
only as many of the things as we can—ascertaining what proportion 
of these possess the given characteristic, and thence zzferring what 
probable proportion of the same things in the region under 


consideration belong to the same class. 


* To be obtained as a separate publication for two shillings and sixpence, postage 
included, from the Clerk of the Laboratory, School of Tropical Medicine, University. 


Liverpool. MI rights reserved. 
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For ¢xample, if we wish to know with certainty how many people 
of any natonality have blue eyes, then we must examine all the 
people of thatnationality. But this will be impossible. Hence we 
must examine 10,0F 100, or 1,000 or more of the people; ascertain 
how many of ¢hese have blue eyes; and then izfer from this sample 
what proportion of the whole ration are Zzkely to have them. 

Everyone knows, of course, fat. (provided the methods of 
examination are always equally careful anl.trustworthy) our estimate 
is more likely to be near the truth if we examima large sample than 
if we examine only a small one-—that is. if we examne. many of the 
things than if we examine only a few. For instance, it vould be 
absurd to attempt to estimate the proportion of persons with blueyes 
by examining only five or ten persons. We should come nearer>y 
examining one hundred or one thousand, and so on; but we shoul 
reach absolute certainty only by examining all the people in the 
country. The important question now arises: How many of the 
things must we examine in order to become reasonably sure that 
our result is within a given percentage of the exact truth? The 
labours of mathematicians enable us to answer this question, and the 
following Tables enable us to answer it in most cases without 
calculation. We must begin by dealing with the case in which the 
total number of things is so large that we cannot take for a sample 
more than a very small proportion of that total number. Hence the 
heading of this section is Proportionately Small Samples. 

First, however, we must understand what exactly we mean by 
the phrases ‘reasonably sure’ and ‘a given percentage of the 
truth” Both these phrases contain ideas of degree. Thus, if we are 
carrying out a strict scientific enquiry, we may wish to be able to 
say that the betting (or probability, as it is called) is 99,999 to 1 that 
our result is within one per cent. of the truth. We must then 
consult Table A 1, under the second column. Or it may suffice to 
say that the betting is 99,999 to 1 that our result is within five per 
cent. of the truth; and we then consult the same table under the 
sixth column. But it may happen that we can afford to be content 
with a lower degree of probability than this—it may suffice to say 
that the betting is only 99 to 1 that our result is within 3 per cent. of 
the truth; and we then look at Table A 4, under the fourth column. 
Lastly, we may be allowed to content ourself with an ‘even chance’ or 
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‘toss up'—that is, a probability of 1 to 1; and for this we consult 
Table A 6. 

The reader will now be easily able to understand the tables. The 
figures in the heading of each give the degree of sureness (so to 
speak) with which we may rely upon the inferences drawn from the 
table. Thus the odds are 99,999 to 1 that the inferences to be 
drawn from the first table are sound; or, in other words, out of 
100,000 trials of the table only one is likely to be wrong. Table 
A 5, however, is likely to be wrong once in ten trials; and Table 
A 6, once in two. 

The percentages at the head of each column give the percentage 
of ‘ statistical error '—that is, the amount by which the truth is likely 
to diverge from our observed result. Thus, if our result is 70 per 
cent., and the statistical error is 5 per cent., then the odds (as 
denoted by the figures 99,999 to 1, etc.) are that the truth will lie 
anywhere between 70 + 5 per cent. and 70 — 5 per cent.—that is, 
between 75 per cent. and 65 per cent. If, however, the statistical 
error is only one per cent., then the truth is likely to lie between the 
narrower limits, 71 per cent. and 09 per cent. If the observed 
result, 2/4s the error, exceeds 100, or if the observed result, minus the 
error, is less than 0, we conclude that the number of things hitherto 
examined 1s not yet large enough to yield a useful result. 

The figures down the first (left hand) column of each table refer 
to the percentage of the observed result up to 50 per cent.; and the 
figures on the same line in the body of the table give the total number 
of things which must be examined in order to ensure that that result 
will lie within the percentage of error given at the head of the 
corresponding column, with the degree of probability given at the 
head of each table. Thus, if the observed result is 43 per cent., we 
must examine 47,780 things to ensure that the odds are 99,999 to | 
that the truth lies between 44 per cent. and 42 per cent. But 
we need examine only 1,328 things to ensure that the betting is 
99,999 to 1 that the truth lies between 49 per cent. and 37 per 
cent. We must examine 26,541 things to ensure, with a probability 
of 999 to 1, that the truth lies between 44 per cent. and 42 per cent. 
(Table A 3); but we need examine only 415 things to ensure, with a 
probability of g to 1, that the truth lies between 47 per cent. and 39 
per cent. (Table A 5); and only 185 things that, with a probability 
of g to 1, it lies between 49 per cent. and 37 per cent. 
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If the observed percentage is over 50, we subtract it from 100 
and obtain the required figures for the remainder. For example, the 
figures for observed percentages of 70 per cent., 79 per cent., and 85 
per cent. are precisely the same as those for 30 per cent., 21 per cent. 
and 15 per cent. 

On inspecting the tables we see at once that the figures diminish 
rapidly in successive tables, and also in successive columns (from left 
to right). From these observations we gather, as we might have 
expected, that the number of things required to be examined 
diminishes (a) with reduced probability of correctness, and (8) with 
increased statistical error. l 

The figures increase as we descend the columns. That is, the 
number of things required to be examined increases as the observed 
percentage rises from 1 per cent. to 50 per cent. ; but after that (by 
the rule just given) it diminishes as the observed percentage continues 
to rise from 50 per cent. to 100 per cent. 

The tables are calculated for only six degrees of statistical error, 
namely, from i per cent. to 6 per cent. But it is easy to obtain the 
figures for any required degree, simply by dividing those in the 
column for 1 per cent. error by the square of the required degree. 
Thus for a probability of 99,999 to 1, and an observed percentage of 
43, we must examine 478 things for a statistical error of 10 per cent., 
and 4,778,000 things for a statistical error of 1-10th per cent. It 
will be seen that the columns for 2 per cent., 3 per cent., etc., follow 
this rule. 

It is also easy to obtain approximately the figures for fractional 
observed percentages, such as 2'5 per cent., or 27°3 per cent., because 
the increase in the number of things required to be examined is, 
roughly, proportional to the increase of the observed percentage. 
Subtract the next lower from the next higher figure in the table; 
multiply the remainder by the decimal fraction, and add the result 
to the next lower figure. Thus in Table A 1, in the column for 
1 per cent., the figure opposite 2°5 per cent. observed percentage will 
be about 4,747; and opposite 27°3 per cent. will be about 38,086. 
But such refinements will rarely be needed. 

We are often obliged to examine such a small number of things 
that the error is evidently greater than the 6 per cent. calculated for 
in the Tables, and we may wish to know exactly how much it is. In 
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this case proceed as follows: Select the degree of probability 
required, and in the appropriate table look out the observed 
percentage actually obtained. Take the number opposite to this in 
the column for 1 per cent. error, and divide it by the number of things 
actually examined. The error will be the square root of the quotient. 
Thus, suppose that we have examined only 300 leucocytes, and have 
found 45 per cent. of these to be mononuclears. The number we 
ought to examine for a I per cent. error at a betting of 0,999 to 1 is 
37,459. The ratio of this number to 300 is 125; and the square root 
of this is 11:2. Hence the statistical error of our work at this 
betting is 11°2 per cent. But 6,697 leucocytes would have sufficed 
at a betting of only 9 to 1. The ratio of this to 300 is 22°32; 
so that the statistical error at 9 to 1 is only 4'7 per cent.—as could 
have been roughly inferred from Table A 4. l 

The reader should note the large number of things which must 
be examined before a result can be obtained to any high degree 
of probability and within narrow limits of error ; and he will doubtless 
remember many confident assertions based upon much smaller 
samples. 


EXAMPLES 

1. How many persons of one nationality must be examined 
before we can assure ourselves, to a probability of 99,999 to 1, that 
from 65 per cent. to 67 per cent. of all the nation do not possess 
blue eyes? Answer: 43,744. 

2. How many of a patient’s leucocytes must be examined before 
we can bet 999 to 1 that between 42 per cent. and 40 per cent. of 
all his leucocytes are mononuclear ?* Answer: 26,194. 

3. How many of his leucocytes must be examined before we can 
bet about 100 to 1 that 69 per cent. to 65 per cent. of all his leucocytes 
are ‘polynuclear’? Answer: 3,668. 

4. How many of his leucocytes must be exainined before we can 
bet 9,999 to I that his eosinophile leucocytes number between 1'5 per 
cent. and 2°5 per cent. of his total leucocytes? Answer: 11,868 
(Multiply figure in column for 1 per cent. in Table A 2 by 4). 

5. On examining 100 of his leucocytes we find that 7 per cent. 


* Always supposing that the leucocytes are evenly distributed throughout the 
circulation. 
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of them are large mononuclears. How many more leucocytes 
must we examine before betting «40/4 10,000 to 1 that the same 
percentage holds for all the leucocytes in his body within an 
error of 3 per cent.? Answer. 008. 

6. On examining 65 case-records we find that 13 of the patients 
died. How many more case-records must we examine before betting 
99 to I that the case-mortality of the disease is between 10 per cent. 
and 21 per cent? A@swer- 10,551. 

7. Our new line of treatment has cured one out of five cases of a 
hitherto incurable disease. How many more cases must we treat 
before betting 999 to 1 that we can cure between 15 per cent. and 
25 per cent. of all cases? Answer, 688. 

8. On examining nearly 2,000 of a patient's red corpuscles, we 
find that 8 per cent. of them are nucleated, and bet 9 to 1 that the 
percentage of nucleated red-corpuscles in his whole body is— what? 
Answer: Between 7 per cent. and 9 per cent. 

9 On examining 220 things, we find that 31 of them belong to 
a particular class. What is the statistical error at a probability of 
99 to 1? Answer: 06 per cent 

10. On examining 138 leucocytes we find that 15 per cent. of 
them are large mononuclears. What is the proportion of large 
mononuclears in the whole body, at a betting of about 1,000 to 1? 
Answer: Anything between 5 per cent. and 25 per cent. (Find the 
figure for 10 per cent. error.) 

II. On examining 500 malaria parasites, we find that 14 per 
cent. of them are sexual forms. What is the betting that the 
statistical error is about 1 per cent.? Answer: An even chance. 

12. In the same case, what is the betting that the error is not 
greater than 4 per cent.? Answer; 90 to I. 

13. Out of 200leucocytes we find 23 per cent. to be mononuclears. 
What is the statistical error at a betting of 999 to 1? Answer: 979 
TORE. 

200 

14. Next day, in the same patient, out of the same number of 
leucocytes, we find 40 per cent. to be mononuclears. What is the 
statistical error at the same betting? Answer: 11°4 per cent. 

15. Can we bet 999 to 1 that there has been an increase of 
mononuclears in this case? Answer. No, because the errors overlap ; 


per cent. (Square root of 
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that is, the difference between 23 and 40 is less than the sum of 979 
and 114. 

16. Can we bet 99 to 1 that there has been an increase? 
Answer; Yes, because with this lower degree of probability the sum 
of the errors, namely 7:7 and 8'9, is less than the difference between 
the observed percentages, 23 and 40. 

17. Working at a probability of 999 to 1, we find that out of 
100 leucocytes two are eosinophiles. What is the error? Answer: 
Between 4 per cent. and 5 per cent. What are we to conclude? 
Answer: That we must examine more leucocytes until the error is 
at least less than the observed percentage. 

18. On examining 136 things we find about 10 per cent. to 
belong to a particular class. What, roughly, is the error at a betting 
of 9,999 to 1? Answer: 10 per cent. (Find the square root of the 
quotient of 13,621 divided by 136.) 

19. On examining 200 things we find 80 of them belong to a 
particular class. What is the error at a betting of 99,999 to 1? 
Answer; 153 per cent. (Divide 40,785 by 200, and find the square 
root of the quotient.) 

20. Working at a probability of 9,999 to 1, and an error of 1 
per cent., how many things must we examine in order to assure an 
observed percentage of 417? Answer: About 36,789. 


1. THE CORRECT PROCEDURE IN PRACTICAL WORK 


The Tables will, then, be of practical use in many kinds of sanitary 
and medical work; as, for instance, in estimating the frequency of 
death, or of some symptom in a given disease; or of some symptom, 
such as enlargement of the spleen or rickets, in a population; or in 
making differential counts of leucocytes in a patient, or of colonies 
of bacteria growing on a plate culture. But an cxamination of 
the Tables will convince us that the procedure now generally adopted 
in attempting such estimates is very faulty, because observers seldom 
trouble much regarding that all-important point, the size of the 
sample—that is, the total number of things which they must cxamine 
in order to obtain a sufficiently correct result. Certainly, often 
(though not always), they recognise that the sample must be large ; 
but they usually fix its size quite arbitrarily—as, for instance, when 
they say beforehand that 200, or 500, or 1,000 leucocytes must be 
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examined for differential counts. This, however, may lead to the 
most untrustworthy results, because, as we have seen, the size of the 
required sample is not fixed, but depends on several factors, 
including the observed percentage of things of the particular class— 
that is, the very percentage which we are seeking to ascertain. We 
cannot, therefore, fix the size of the sample beforehand, but must do 
so as we proceed in the work. 

The size of the sample depends upon three factors, namely : 

(1) The degree of sureness which we have to attain ; 

(2) The percentage of statistical error, or degree of accuracy, 
which may be allowed ; and 

(3) The observed percentage of things of the particular class 
which we are endeavouring to ascertain. 

The correct procedure is, therefore, as follows :— 

(1) First decide definitely as to the degree of sureness which must 
be attained, and the percentage of error which may be allowed. 
These will depend upon the importance of the work and the time 
which we can devote to it. For strict scientific or large sanitary 
investigations we may require a very high degree of sureness, say, 
99,999 to 1, and very small limits of error, say 1 per cent. And this 
will be specially the case when we have to compare results obtained 
at different times; as, for instance, when we wish to know whether 
the mononuclear leucocytes increase with the progress of a disease 
(see examples 13-16), or whether an epidemic is diminishing. Here it 
is absolutely essential that the statistical errors obtained at the two 
different times are not large enough to overlap. On the other hand, 
we may often be permitted to adopt lower degrees of sureness and 
high percentages of error, especially when we are merely seeking 
some corroborative evidence or when differences between successive 
estimates are so large and striking that even a large percentage of 
error cannot mislead the judgment. Here, as in regard to the 
following paragraph, we must often be guided by the progress of the 
work. But, as soon as we decide upon these points, we can 
determine which table, and which column in that table, are to be 
used: 

‘ (2) Secondly, before fixing upon the size of the sample, we 
should endeavour to obtain by trial a rough estimate of the observed 
percentage of things of the particular class which we are studying. 
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Suppose, for example, that we have to make a differential count of 
leucocytes. Then we do not wish, on the one hand, to allow too 
much statistical error, or, on the other hand, to waste time over 
examining too large a sample. Suppose, first, that the 9 to 1 Table 
is sufficient. Begin by examining 100 leucocytes. Suppose that 
jo per cent. of these are mononuclears. Then we can see at once 
from the Table how many leucocytes must be examined to give a 
reliable result at that observed percentage. If a 6 per cent. error 
will suffice, we anticipate that we shall require to examine only SI 
more leucocytes. If a 2 per cent. error must be obtained we shall 
have to examine 1,524 more leucocytes. «As we now proceed in the 
task we shall find that the observed percentage changes considerably 
when we have examined 200, 300, 400 leucocytes, and so on (we 
should calculate the percentage, not for each successive batch of 100 
leucocytes, but for the total number examined from the beginning). 
Finally, when about 1,400 leucocytes have been examined, we 
anticipate that we are approaching the required limit (for 2 per cent. 
error). Suppose that at 1,559 leucocytes the observed percentage 
stands at just about 36 per cent—thus agreeing with the Table. 
We then stop; having obtained a 30 per cent. ratio, with a betting 
of 9 to I, and a statistical error of 2 per cent. That is, the probability 
is Ọ to I that the truth lies between 38 per cent and 34 per cent.; 
and we have not wasted time in reaching this result. 

If, however, we require high degrees of probability, or iow degrees 
of error, or both, there will be little use in attempting the preliminary 
rough estimate by a small sample of only 100 leucocytes, and we 
had better take for it at once 500 or 1,000 or more as the case might 
be. 

(3) Of course, in differential leucocyte counts we often possess 
beforehand some inkling of what observed percentage we are to 
expect. Thus, the eosinophiles are generally few in number, and 
the ‘polynuclears’ numerous; and we judge roughly regarding the 
size of the sample accordingly. The same thing usually happens 
in other kinds of enquiry. 

(4) It is a great mistake to suppose that a large sample will 
compensate for inaccurate working. These Tables are based on 
the supposition that each thing examined has been accurately 
assigned to its proper class. Things which cannot be certainly 
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assigned to their proper class must be rejected; but the number of 
them must be noted, and the proportion which they bear to the 
whole number of things must be afterwards determined, with 
estimates of probability and error, by precisely the same methods 
as those described. They constitute, in fact, a third class by 
themselves. 

(5) If the things under study can be divided into three or more 
classes, determine separately the proportion of each class to the 
whole. This does not necessarily require different series of 
investigations. We simply extract the figures from the records; 
but care must be taken that the samples are sufficient for each class 
by itself. 

(6) If while examining successive samples of things (such as 
leucocytes) we find that the observed percentage in each sample in 
succession tends always in one direction, that is, either to increase or 
to decrease, then we may suspect that some influence other than 
mere chance is at work. The number of successive samples required 
to verify such a suspicion will depend upon the nature of the material, 
and might be large; but in many cases if the observed percentage 
always increases, or always diminishes, in at least five successive 
samples, then we may have grounds for further enquiry upon the 
point, or for reference to a trained statistician. 

(7) The probability or degree of betting which is generally 
accepted by statisticians as amounting almost to certainty is 49,999 
to 1. The figures for this can be obtained by multiplying the 
corresponding figures for a betting of 9,999 to 1 (Table A 2) by the 
factor 1189; or, what is nearly the same thing, by increasing the 
figures in Table A 2, by 20 per cent. 

(8) Great care must always be taken that samples are chosen 
really at random, and are not selected with any conscious or 
subconscious bias. 


3. PROPORTIONATELY LARGE SAMPLES 


We have hitherto dealt with Comparatively Small Samples—that 
is, with samples which are small compared with the total number of 
the things in existence. For instance, if we are studying all the 
people in a country or all the leucocytes in a patient's body, we shall 
seldom be able to examine more than a very small proportion of 
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these people or leucocytes. But there are cases when the total 
number of things is not so very large that we cannot examine a very 
considerable proportion of them. Suppose, for instance, that we 
wish to ascertain the proportion of children with enlarged spleen, 
not in the whole world, but in a large village ; and suppose that there 
are 200 children in the village, but that we have time to examine 
only 50 of them. Here the sample is comparatively large, being one 
quarter of all the children in the village. Or suppose that we can 
examine 180 of the children; here the sample is so large that it 
approaches the whole number of things under study (i.e, the children 
in the village). Obviously, on examining these large samples we 
shall approach much nearer to the exact truth than would be 
anticipated from the Tables. In the second case, for instance, we 
should have to examine only 20 more of the children in order to 
obtain absolute certainty (provided that the examinations are careful 
enough). Hence, clearly, the Tables must be corrected for the case 
of Large Samples. A very suitable and easily applicable method 
for this purpose is to multiply the statistical error given in the 
Tables by the Factor. 
YA a 
Sa NEG 


where # is the number of things in the sample, and V is the total 
number of things under study. 
Examining this Correction Factor, we see that when x is very 
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small compared with V, the term becomes so small that it 
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may be negiected; so that the Factor now becomes the square root 
of unity; that is, unity. This multiplied into the statistical errors 
given m the Tables does not modify them at all—so that the Tables 
are then quite correct. Here we have, of “course, the case of 
Comparatively Small Samples, where V is a very large number, 
such as all the people in a country or all the leucocytes in a person. 


Again, if „=N, that is, when the sample includes all the things 
1 
N= 1 
zero. This multiplied into the statistical errors makes them vanish. 
In other words, there is no statistical error because we have reached 


in existence, the term equals unity, and the Factor becomes 


certainty. 
Between these values the Factor is a vulgar fraction, which can 
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easily be calculated. Thus, if z =50 and Y <= 200, the Factor is the 
square root of = : that is, 0868. This reduces the statistical 


error, but not much. If = 180, the Factor is the square root of 
TE : that is, o°317—which reduces them considerably. 
If the total number of things is at all considerable—say over 20-- 


then the Factor becomes nearly the same as the square root of 
n 


n In this case we can give a table of the various values of 


£ i A n 
the Factor which correspond to the various values of SO. In the 


following Table the proportion of things examined to total things, 


Ma. : 
y » 1S given in percentages, and the corresponding values of the 


Factor are put below : 


Tabu B 
n 
100 = 5 10 15 20 25 30 ag i LE 50 
Factor = 097 095 0:92 O89 O87 0:84 osr 077 0:74 0:71 
100 3, = EE 60 65 79 75 80 95 go 95 Too 
Factor = 067 0:63 0:59 O55 0:50 O45 0°39 0°32 0:22 0:00 


Thus, if we examine half the total things, the statistical errors are 
reduced to about seven-tenths of the values given in Tables A. It 
will be seen that this Factor makes little difference in the statistical 
error unless the number of things examined is more than one-tenth 
of the total. 


EXAMPLES 

21. Nine out of 145 children in a village are absent. On 
examining the remainder we find 14 (say 10 per cent.) of them with 
enlarged spleen. What is the error at a betting of 9,999 to 1? 
Answer: 2°5 per cent. (The Factor is 1/4. See also Example 18.) 

22. There are 1,000 people on an island. Out of 100 of these 
60 are found to be affected with filariasis. What is the statistical 
error at a betting of gto 1? Answer: About 77 per cent. 

23. Out of 884 people in a town, three-quarters are examined, 
and 73 of these are found to be in bad health. How many of all 
the people in the town are likely to be unwell? Answer: Between 
9 per cent. and 13 per cent., at a betting of 999 to 1. 
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4. THE NUMBER OF THINGS OF ONE CLASS EXISTING 
IN A GIVEN AREA, BULK, OR TIME 


Suppose that we wish to know the number of things of one kind 
contained in a given area, bulk, or time. Then the only way to 
ascertain this with certainty is to count all the things. Suppose, 
however, that we have no time for this, and must content ourself with 
counting the things in a measured sample, and then estimating from 
this observed result the most probable number of the things in the 
whole area, bulk or time. The question then arises: How large 
must the sample be in order to reduce the statistical error, with a 
given degree of probability, to below a given percentage? 

For example, suppose that we wish to know how many separate 
stones there are in a million cubic feet of gravel. We cannot count 
them all, and must, therefore, content ourself with counting how 
many there are in a sample of, say, one, two or more cubic feet. 
In how many cubic feet, then, must we count the separate stones 
in order to be able to calculate the total with the required 
degree of accuracy? Or suppose that we wish to know how many 
leucocytes or parasites there are in the total blood of a patient, then 
m how much of his blood must we count these objects in order that 
the most probable truth will lie between sufficiently narrow limits? 
Shall we take one, two, or more cubic millimetres of his blood for 
our sample ? 

First, in order to use the sampling method at all, we must 
know that the things are equally distributed throughout the 
area, bulk, or time. If this is not the case, we cannot know that our 
sample accurately represents the whole material. For example, it 
would be useless to attempt to estimate the population of Britain 
by counting all the people in one-tenth of the area of the country 
because the population is not distributed equally at random 
everywhere, but is gathered specially into certain districts and cities 
according to certain economic laws. Similarly, we cannot estimate 
by taking samples of the peripheral blood how many blood parasites 
there are in the whole body unless we know that these organisms 
do not collect specially in certain parts of the circulation. 

But--it may be asked if the things are equably distributed, 
what further trouble will there be? We have only to count the 
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number found in any sample, and then to multiply that figure by 
the total number of samples contained in the whole area, bulk, or 
time. If the stones are equally distributed in a million cubic feet 
of gravel, then there will be exactly a million times as many in the 
whole mass of gravel as there are in one cubic foot. But this is 
not so. It may be that dy chance the stones in the first cubic 
foot taken as a sample are exceptionally large, and therefore 
are exceptionally few. Or it may happen by chance that the 
trypanosomes in a first sample of blood are exceptionally 
numerous, or exceptionally few, as the case may be. We shall then 
form a totally wrong estimate if we trust merely to the simple but 
untruthful method just mentioned. 

To obtain accurate estimates by any method in cases like these 
may require the services of a trained statistician, and also, often, 
a special study of the kind of material under consideration— 
especially to ascertain whether the things are really equably 
distributed. But for the purposes of this Article the following 
Table will often be useful, because it serves to give some idea of 
the number of things which must actually be counted if they are 
equably distributed throughout the whole area, bulk, or time. 


TABLE C 
PROBABILITY | STATISTICAL ERRORS 
| K F | J 
P se 3% | 4% | 5% | 10% | 20% 
I tot UE 1,138 506 285 182 46 II 
gtor | 27,057 6.764 3,006 | 1,691 | 1,082 271 63 
99 to I | 66,350 16.588 7.372 | 4,147 2,654 664 166 
999 to 1 | 108,284 27,071 12,032 6,768 ESRI 1,083 271 
9,999 to 1 | 151,338 373835 16,815 9,459 6,053 1,513 378 
99,999 to 1 | 194,938 | 48,733 | 21,660 | 12,184 7,797 1,949 | 487 


Suppose, for example, that a cubic foot of gravel has been found 
to contain 3,006 stones, then the most probable number of stones in 
a million cubic feet of the same gravel will be 3,006 millions, within 
an error of 3 per cent., and at a betting of gto 1; that is, we may 
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bet 0 to 1 that the most probable total number of stones in the 
million cubic feet will lie between about 3,096 and 2,916 millions. 
Or suppose that 271 trypanosomes have been found in one cubic 
millimetre of blood in a patient weighing 64°74 kilogrammes 
(142 lbs., or about 10 stone English), who should contain about 
3,000,000 c.mm. of blood altogether; then we may bet 9 to 1 that 
the most probable number of trypanosomes in the whole of his 
blood will lie between 894,300,0co and 731,700,000 (10 per cent. 
error). And if we have counted 4,200 leucocytes in the c.mm. of 
blood, we may bet 9,999 to 1 that his total blood contains between 
13,356 and 11,844 millions of these cells (6 per cent. error). 

As stated in Section 1 (page 350) if we wish to know the number 
of things to be counted in order to yield an error within more than 
5 per cent. we have only to divide the figures under, the 1 per cent. 
column twice over by the required percentage (that 1s, by the square 
of the required percentage). Thus, for an error of 10 per cent. we 
divide by 100; and for an error of 31°6 per cent. we divide by 1,000. 
For example, if we find 27 malaria crescents in I c.mm. of the same 
patient’s blood, we may bet 9 to 1 that he contains between about 
106 and 56 millions of crescents altogether (always provided that 
they are equably distributed in the blood). 

The above Table is only for proportionately small samples, as 
defined in Section 1 (page 348); that is, for samples which are 
small compared with the total mass of material. When the sample 
is more than about one-tenth of the total material we should use 
the Correction Factor of Section 3 (page 357) for proportionately 
large samples. Thus, if 1,500 leucocytes have been counted in one- 
quarter c.mm. of blood, and we wish to calculate the number in 
I cmm, then the error by the above Table is about 10 per cent. at 
a betting of 9,999 to 1. But by the Table on page 358 the Correction 


Factor is 0°87 when ` equals one-quarter, or 25 per cent. Thus 
ihepemo is not! 19 per cent, but &'z per cent, and we may bet 
9.990 to 1 that the number of leucocytes in 1 e.mm. of blood is 
hetween 6,522 and 5,478. But for the total blood content of 
3,000,000 c.mm. the error of 10 per cent. must be maintained, 
because the one-quarter cinm. is now a ‘proportionately small 
sample. This gives the number of leucocytes in the whole body 
as most probably lying between 19,800 and 16,200 millions. 


302) 


We can use the Table in another way. Suppose that we have 
counted 664 things in a sainple. Then the error 15 10 per cent. on 
a probability of 99 to 1. Hence we can bet 99 to 1 that all counts 
in future samples of the same size will he between 7304 and 5976— 
assuming that the sample is proportionately small. This way of 
stating the case avoids the necessity of determining exactly the size 
of the sample compared to the whole material. In blood counts, 
for instance, we are concerned with the number of things in unit 
of blood rather than in the whole body, and we often wish to know 
whether this number is increasing or diminishing. If the number 
of things in a second sample is outside the limits of error declared 
from the first sample, we may assume, at the appropriate probability, 
that there has been an increase or decrease, as the case may be. 
If otherwise, the difference may be due merely to chance in the 
taking of the samples, and not to any real change in the total number 
of things in the whole body. 


EXAMPLES 


24. We have counted 4,250 red corpuscles in one-thousandth of 
a cubic millimetre of blood. What is the most probable number in 
one cubic millimetre, at a betting of 99 to 1? Answer. Between 
about 4,420,000 and 4,080,000. 

25. How many may we expect to find in a second sample of 
the same size, at a betting of 999 to 1? Answer: Between about 
4,403 and 4,037. 

26. A week ago we found 490 red corpuscles in one-ten- 
‘thousandth of a cmm of a patient’s blood. To-day we find only 
294 in the same sized sample. May we bet 99,999 to 1 that there 
has been a decrease? Answer: No, the errors overlap. May we 
bet 9 to 1 that there has been a decrease? Answer: Yes. 

27. Blood has been diluted 100 times. Ln teth of 1 c.mm. 
of the mixture we found 500 red corpuscles. How many do we 
expect to find in 1 c.mm. of the blood at a probability of 909 to 1? 
Answer: 5,000,000, with an error of 14°77 per cent. 

28. We have found 553 things in a sample. In how many out 
of 100 similar samples of the same material should we expect to find 
an error greater than 7 per cent.? Answer: In ten. 

29. A newly-appointed official finds that he is obliged to write 
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150 letters during his first week of office. How many letters should 
he expect, at a betting of 999 to 1, to have to write at the same rate 
Every week in the future? LEE 150, with an error of 26°37 
per cent. 

30. We have found one filaria embryo in 1 cmm. of blood. 
What may we infer regarding the total number in the whole 
circulation of 3,000,000 c.mm.? Answer; The odds are 1 to 1 that 
the error is less than 67's per cent., and that the total number of 
embryos in the circulation lies between 5,025,000 and 975,000. In 
one ont of two such cases the error may exceed this amount. 
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II, TABEES 


A 1. 99999 to I 
Errors 

28 3 "a + 0 

483 205 IZI 

956 425 239 
1419 631 355 
1872 832 468 
DS Wo. 579 
2749 1222 688 
3175 bo jo 
3587 1595 897 
3998 T 998 
4387 1950 1097 
NGR 2121 1193 
5147 2288 1287 
5512 2159 1378 
5868 2608 1407 
6214 2762 1554 
6550 2012 1638 
6877 | 3057 1720 
BEE N 319) 1799 
EG 3334 1875 
7798 3466 1950 
8085 3594 20 
8363 agug 2091 
8631 3836 2158 
8889 3951 222; 
9138 4062 2285 
9377 4168 2345 
9606 4270 2402 
9825 1367 2457 
10035 4460 2509 
T0235 4549 2550 
10425 4634 2607 
10605 4714 2652 
10776 4789 2694 
10936 4861 2734 
11088 4928 2772 
11229 4991 2808 
11360 5049 2840 
11482 5104 2871 
11592 5152 2898 
11697 5199 2925 
11789 5240 2948 
11872 5277 2968 
11945 5309 2987 
eee 5337 pes 
12062 5361 3016 
12106 5381 3027 
12140 5396 3035 
12165 5407 3042 
TS 5413 3045 
ee TA 
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A2. 9999 to I 
Errors 

I Da 2 % 5 % | + % 

I 1499 | 375 167 | 94 
2 2967 742 330 186 
3 {404 110I 490 276 
4 5812 1453 646 304 
5 7189 1798 799 450 
6 8536 | 2134 949 534 
7 9852 2463 1095 616 
8 11139 2785 1238 | 697 
9 12395 | 3099 1378 775 
10 13621 3406 1514 852 
II 14816 3704 1647 926 
12 15982 3996 1776 999 
13 i ' 4280 1902 1070 
14 18221 4556 208R 1139 
15 19296 4824 2rd (I 1206 
16 20340 5085 2260 1272 
T 21354 3339 21 K339 
18 22338 5585 2482 1397 
19 23291 5823 7 2588 1456 
20 24215 6054 — 2691 1514 
21 25107 6277. | 2790 1570 
22 25970 6493 | 2886 162 
2 26802 6701 | 2978 1676 
2 | 27605 6902 3068 1726 
25 28376 7094 3153 1774 
26 29118 7280 3236 1820 
27 29829 7458 © 3315 1865 
28 30510 7628 3390 | 1907 
29 31160 7790 3463 | 1948 
30 31781 7946 3532 1987 
31 32372 8093 3597 202 
32 32932 8233 3660 2059 
33 33401 8366 3718 2092 
34 33960 8490 3774 2123 
35 34430 8608 3826 2152 
3 | 34868 8717 3875 2180 
37 i 352% 8820 3920 2205 
38 35655 8914 3962 2229 
39 36004 GOOI 4001 2250 
40 36322 9081 | 4036 2270 
#1 36609 9153 | 4068 2289 
42 36866 9217 4097 2305 
43 37993 2/4 AE og 
ttt EENS 2023 HHT Zool] 
45 37459 9365 4162 2342 
$6 37593 9399 4177 2350 
47 37699) 9425 $189 2357 
48 3777 9444 4198 2301 
49 37820 9455 4203 2364 
50 37835 9459 4204 2305 
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A3. 999 to I 


Errors 


wv 
NI 
NI 
LA 


E 


322 
382 
mi 
499 
555 
610 
663 
715 
766 
815 
863 
010 
955 
999 

1042 

1083 

1123 

1162 


TSTS 
1255 
1269 
1303 
1334 
1365 
1394 
IARE 
1448 
73 
or 
1519 
1540 
1560 
1578 
1595 
1611 
1625 
1638 
1649 
1659 
1668 
1675 
1682 
1686 
1689 
1692 


1692 


Nuit oN = 


DINI 


SO 


OTOH 
10211 


10616 
11007 
11386 
11752 
12104 
12441 
12767 
13078 
13376 
13682 
TSH 
ig 
14439 
14672 
14890 
15096 
15287 
15407 
15633 
15787 
15925 
16051 
16164 
16263 
16349 
16422 
16482 
16532 
16562 
16582 
16588 


AN dle 


165 
os 
183 
638 
788 
936 
1080 
1221 
1559 
IN ok 
1624 
1752 
L 1876 
1998 
2116 
2230 
234.1 
“HY 
2553 
2654 
2752 
2847 
2038 
3026 
JILI 
3192 
3270 
Oasi 
mee 
3484 
5545 
3610 
3668 
3723 
SATA 
3822 
3867 
S) 
3047 
3982 
wees 
4041 
4066 
4088 
4106 
4121 
HZ 
JIJI 
4146 
ile 
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ERRORS 
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A5. 0 to 
ERRORS 
eo TE, 
30 17 II 
59 S Be 
88 50 32 
116 65 42 
143 | si | 52 
170 | 96 62 
196 III 71 
222 125 80 
| 247 139 89 
271 153 | 98 
295 166 106 
318 179 | 115 
341 192 123 
362 204 | 131 | 
384 216 139 | 
405 228 146 
125 239 153 
444 250 160 
463 261 167 
482 271 174 
499 281 180 
516 291 186 
SDE 300 TS 
549 309 198 | 
564 318 203 
579 326 209 
593 I 334 “mo 
607 341 219 
619 349 223 
632 356 228 
644 362 232 
655 368 | 236 
665 374 | 240 
675 | 380 243 
684 | 385 246 
693 | 390 250 
701 395 253 
109 399 295 
716 403 258 
722511 406 260 
728 410 262 
733 412 264 
737 415 266 
741 417 267 
745 419 268 
747 421 269 
ii) fa 270 
751 $23 api 
12 423 27) 
Da b 271 
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ISI 
182 
184 


186 
187 


188 
188 
188 


AG 
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I toI 

ERRORS 
3 Yo 4% 5 0 6% 
5 7 2 I 
10 6 4 2 
15 8 5 4 
19 II 7 5 
i Tea 9 6 
29 16 IO 7 
33 19 12 5 
a7, 21 Le 9 
JI 23 15 10 
t5 26 16 II 
49 28 18 12 
EK 30 19 Lo 
57 42 21 14 
61 Jade 22 15 
04 36 23 16 
68 38 25 17 
71 40 26 18 
75 U S 19 
78 44 28 19 
SI 45 29 20 
8 47 30 21 
87 48 31 22 
go 50 32 22 
2 52 do 23 
25 L 34 si 
97 55 35 2 
100 5 36 25 
Tas 57 il 25 
LOT 59 37 20 
106 60 38 37 
108 61 39 27 
110 62 40 28 
112 03 40 28 
113 64 41 28 
115 65 4I 29 
116 66 41 29 
118 66 42 29 
119 67 42 30 
120 68 43 30 
121 68 43 30 
122 69 ef 31 
123 69 44 31 
12 70 44 31 
125 70 15 31 
ie 70 45 S 
126 71 45 3I 
120 71 {5 2 
fen 71 45 32 
120 71 45 z 
126 71 45 32 


